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MICHAEL BATEMAN AND CHRISTOPH THIELE 

Abstract. Stein conjectured that the Hilbert transform in the direc- 
tion of a vector field is bounded on, say, L 2 whenever v is Lipschitz. 
We establish a wide range of L p estimates for this operator when v is a 
measurable, non- vanishing, one- variable vector field in R 2 . Aside from 
an L 2 estimate following from a simple trick with Carleson's theorem, 
these estimates were unknown previously. This paper is closely related 
to a recent paper of the first author (pi). 



1. Introduction 
Given a non- vanishing measurable vector field v: M. 2 — > M. 2 , define for 

H v f{x,y)=p.v. / dt . (1.1) 



In this paper we prove: 

Theorem 1. Suppose v is a non-vanishing measurable vector field such that 
for all x, y G M 

v(x,y) = v(x,0) , 
and suppose p € (|, oo). Then 

II^/|Ip<II/Hp • 

The estimate is understood as an a priori estimate for all / in an appro- 
priate dense subclass of L P (M?), say the Schwartz class, on which the Hilbert 
transform H v is initially defined. One can then use the estimate to extend 
H v to all of LP(R 2 ). 

If the vector field is constant, then this follows from classical estimates 
for the one dimensional Hilbert transform by evaluating the L p norm as an 
iterated integral, with inner integration in the direction of the vector field. 
Theorem [l] follows from the special case for vector fields mapping to vectors 
of unit length, because the Hilbert transforms along v and A are equal by 

a simple change of variables in ([O]). To prove the theorem for unit length 
vector fields, it suffices to do so for vector fields with non-vanishing first 
component, because we can apply the result for constant vector fields to the 
restriction of H v to the set where v takes the value (0, 1) and the set where 
it takes the value (0,-1). Dividing v by its first component we may then 

l 
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assume it is of the form (l,u(x)); note that multiplying v by a negative 
number merely changes the sign of (|1 , 1|) . We call u the slope of the vector 
field. The Hilbert transform ( |1 , 1|) then takes the form 

it t( \ f f(x ~t,y -tu(x)) 

H v f(x,y)=p.v. / dt . (1.2) 



1.1. Remarks and related work. The case p = 2 of Theorem [T] is equiv- 
alent to the Carleson-Hunt theorem in 1? . This observation is attributed 
(without reference) to Coifman in [|l4|] and to Coifman and El Kohen in ||. 
We briefly explain how to deduce Theorem [l] for p = 2 from the Carleson- 
Hunt theorem. Denote by Ti the Fourier transform in the second variable. 
Then we formally have for ( |1.2| ), ignoring principal value notation, 

e 2,i W j T2 f {x _ tiV) _ dtdr j . 

As the inner integral is independent of y, it suffices by Plancherel to prove 

/g— 1-niu{x)r\t 
Fzf{x - t,v) t dt \\ <\\^fh ■ 

Applying for each fixed r/ the Carleson-Hunt theorem in the form 

/ e -2iriN(x)t 
g(x-t) dt|| 2 < \\gh > 

for g 6 L 2 (M) and measurable function N proves the desired estimate. 
For any regular linear transformation of the plane we have the identity 

(H TovoT - 1 f)oT = H(foT) . 

The class of vector fields depending on the first variable is invariant under 
linear transformations which preserve the vertical direction. This symmetry 
group is generated by the isotropic dilations 

(x,y) -4 (Ax, Ay) , 

non-isotropic dilations 

(x, y) -4 (x, Ay) , 
and the shearing transformations 

(x,y) ->• (x,y + Xx) 

for A / 0. By a simple limiting argument, it suffices to prove Theorem |l| 
under the assumption that ||ii||oo i s finite. By the above non-isotropic scaling 
the operator norm is independent of H^Hocm and we may therefore assume 
without loss of generality that 

Nloc < 10~ 2 • (1.3) 

Following general principles of wave packet analysis, it is natural to de- 
compose H v into wave packet components, where the wave packets are ob- 
tained from a generating function (j) via application of elements of the sym- 
metry group of the operator. These wave packets can be visualized by acting 
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with the same group element on the unit square in the plane. The shapes 
obtained under the above linear symmetry group of H v are parallelograms 
with a pair of vertical edges. All parallelograms in this paper will be of this 



special type. Under the assumption (1.3) it suffices to consider parallelo- 



grams whose non-vertical edges are close to horizontal. Such parallelograms 
are well approximated by rectangles, which are used in [g] and previous work 
by Lacey and Li E3. 

The companion paper proves the following theorem: 

Theorem 2. A ssume Halloo — 1 and 1 <C p <C oo. Assume fi^^rj^ vanishes 
outside an annulus A < |(£,?7)| < 2A for some A > 0. Then 



\H v f\\ p 



Actually, the theorem is stated there for functions such that / vanishes 
outside a trapezoidal region inside an annulus, but this is inessential, as can 
be seen from the commentary below. This theorem is weaker than Theorem 
|l| in the region p > 3/2 but stronger in the region 1 < p < 3/2. The width 
of the annulus can be altered by finite superposition of different annuli, at 
the expense of an implicit constant depending on the conformal width of 
the annulus. The case p > 2 and a weak type endpoint at p = 2 of Theorem 



H are due to Lacey and Li 13] , and hold for arbitrary measurable vector 
fields. 

We reformulate Theorem § in a form invariant under the above linear 
transformation group. Note that the adjoint linear transformations of this 
group leave the horizontal direction invariant. 

Theorem 3. Assume 1 < p < oo. Assume f{i,rj) is supported in a hori- 
zontal pair of strips A < |jy| < 2 A for some A > 0. Then 

\\Hvf\\ P < \\f\\ P ■ 

To deduce Theorem ^ from Theorem [2] we use the non-isotropic dilation 
(x,y) — > (Xx,y) to stretch the annulus in £ direction until in the limit it 
degenerates to a pair of strips A < \rj\ < 2A. The restriction ||it||oo < X 1 
becomes void in the limit A — > 0. This proves Theorem ||. For the converse 
direction we use a bounded number of dilated strips to cover the annulus 
except for two thin annular sectors around the £-axis. It remains to prove 
bounds on functions supported in these sectors. For fixed constant vector 
v, the operator H v is given by a Fourier multiplier which is constant on two 
half planes separated by a line through the origin perpendicular to v. If 
IMloo < I; then this line does not intersect the thin annular sectors, and we 
have with the constant vector field (1,0): 

H v f(x,y) = H (m f(x,y) . (1.4) 

But -ff(i,o) is trivially bounded and this completes the deduction of Theorem 
from Theorem H. 
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Sharpness of the exponent | in Theorem |] is not known. In Remark |9| we 
mention a potential covering lemma that, when combined with the methods 
in this paper, would push the exponent down to |. The truth of this covering 
lemma is unknown, however. If / is an elementary tensor, 

f(x,y) = g{x)h(y) , 

then a similar calculation as above turns H v f into 

/„— 2 f Kiu{x)r\t 
g(x-t)- dtdr, . 

This expression can be read as a family of Fourier multipliers acting on h. 
Assuming the norm of h is normalized to \\h\\ p = 1, we can estimate the last 
display by 

g— 2niu(x)r]t 

g(x-t) dt\\ M p^\\ LP ^ , 

where M p {rj) denotes the operator norm of the Fourier multiplier acting on 
LP . By scaling invariance of the multiplier norm, the factor u(x) in the 
phase can be ignored. As shown in (||), multiplier norms are controlled by 
variation norms. Hence we may estimate the last display by 

g— 2-niv/t 

g(x-t) — - — dtllvr^HiP^) , 

provided 1 1 — - 1 < ~ . The bounds on the variation norm Carleson operator 

in |l6|] imply that for p > | and r > p' the last display is bounded by a 
constant times \\g\\ p - Hence the exponent in Theorem |] can be improved 
to | under the additional assumption that the function / is an elementary 
tensor. The authors learned this argument from Ciprian Demeter. Related 



multiplier theorems in [10], j9|] also show a phase transition at this exponent. 

The Hilbert transform along a one variable vector field has previously been 
studied by Carbery, Seeger, Wainger, and Wright in ||. There boundedness 
in LP for 1 < p is proved under additional conditions on the vector field. 

In a different direction, Stein has conjectured that a truncation of H v is 
bounded on L? under the assumption that the two-variable vector field v is 
Lipschitz with sufficiently small Lipschitz constant depending on the trun- 
cation. Stein's conjecture is related to a well-known conjecture of Zygmund 
on the differentiation of Lipschitz vector fields. Define 



M v f(x,y) = sup -i- / f((x,y) -v(x,y)t)dt. 



r-L 

o<l<i 2L J_ L 

Zygmund conjectured that M v is (say) weak-type (2,2) if ||u||oo is bounded 
and the Lipschitz norm ||Vu is small enough. Proving a weak-type esti- 
mate on this operator would yield corresponding differentiation results anal- 
ogous to the Lebesgue differentiation theorem, except the averaging takes 
place over line segments instead of balls. Estimates on M v are unknown 
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on any L p space, except for the trivial p = oo case, unless more strin- 
gent requirements are placed on v; for example, Bourgain Q proved M v is 
bounded on L p , p > 1 when v is real-analytic and the operator is restricted 
to a bounded domain. The corresponding result for the Hilbert transform is 
announced in [17], although the p = 2 case follows from work of Lacey and 
Li [HJ]. Previously the Hilbert transform case, in such a range of exponents, 
was only known (Q) under the additional assumption that no integral curve 
of the vector field forms a straight line. 

There is some history of using singular integral and time-frequency meth- 
ods to control positive maximal operators. See Lacey's bilinear maximal 
theorem ([fL?]]) or the extension [ 10 1 of Bourgain's return times theorem by 
Demeter, Lacey, Tao, and the second author. 

This paper is structured as follows: Section [2] contains the main approach: 
a separation of frequency space into horizontal dyadic strips and application 
of Littlewood-Paley theory in the second variable to reduce to some vector- 
valued inequality; this step uses the one- variable property of the vector field 
to ensure that the strips are invariant under H v . This fact was brought to 
our attention by Ciprian Demeter. The vector- valued inequality is proved by 
restricted weak-type interpolation, a tool that allows to localize the operator 
to some benign sets G and H and prove strong L 2 bounds on these sets. 

Section ||| gives the crucial construction of the sets G and H relying on 
two covering lemmas. One is essentially an argument by Cordoba and R. 
Fefferman Q, while the other is essentially an argument by Lacey and Li 

©■ 

Section |] outlines the proof of the L 2 bounds on the sets G and H using 
time-frequency analysis as in |2|. The operator that we estimate at this point 
is a refinement of the operator in |2|. We refer to the decomposition of this 
operator in [g] without recalling details. The terms in this decomposition 
satisfy Estimates ^ through which are also taken from Jp]. To complete 
the proof of Theorem [j], we need the additional Estimates ^1 and [2^, which 
depend on the sets G and H. These additional estimates are proved in 
Section ||, again with much reference to Q . 

Throughout the paper, we write x < y to mean there is a universal 
constant C such that x < Cy. We write x ~ y to mean x < y and y < x. 
We write 1_e to denote the characteristic function of a set E. 



2. Reduction to estimates for a single frequency band 

We fix the vector field v with the normalization fll.2j ) and assume bounded 
slope as in (|l.3|) . Let P c be the Fourier restriction operator to a double cone 
as follows: 

It suffices to estimate H V P C in place of H v because, similarly to (Q|), 
H v (l-P c )f(^ V ) = H ilfi) (l-P c )f^,r l ) , 
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due to the restriction on the slope of v. Define the horizontal pair of bands 

B k :={fo V )eR 2 : \ V \ E [2 k , 2 fc +w)} , 

and define the corresponding Fourier restriction operator P k f = ~\-B k f ■ Since 
the Hilbert transform in a constant direction is given by a Fourier multiplier, 
and the vector field v is constant on vertical lines, we can formally write for 
a family of multipliers parameterized by x: 

H v f(x,y)= J J m x (^ V )m V )e 2m ^ + ^ . 
Then it is clear that 
H v (P k f)(x,y) = |l [2ft)2fc+ ^ ) (7 7 ) e 2 ^[|m :r (^7 ? )/(C,r ? ) e 27r ^^]d7 ? 
= P k (H v f)(x,y) . 

Define 

Hk '■= PkH v P c = PkH v P c Pk . 
Littlewood-Paley theory implies 



Wc/ii p <[|( E \ H *f\' 

. fcez/ioo 



where here the summation is over integer multiples of ^ . Using Littlewood- 
Paley theory once more, it suffices to prove 



2 / \ 2 



E \ H k(Pkf)\ 2 ) it s ii E 

v fcez/ioo / \feez/ioo 

which follows from the more general estimate 



1 i 

2 / \ 2 



E \ H kfk\ 2 J iip^iil E i-^i 2 

ioo / Vfcez/ioo 



for any sequence of functions fk E L 2 . By a limiting argument, it suffices to 
prove for all ko > 



1 i 

2 / \ 2 



E i^n it ^ii E im 2 it (2- 1 ) 

J*|<fco / \|fc]<feo / 

with implicit constant independent of ko, where it is understood that k runs 
through elements of Z/100. Compare this inequality with a vector valued 
Carleson inequality as in [11]. 

Theorem (|3|) implies that H k is bounded in L p for 1 < p < oo for each 
k. In particular, ( |2.1| ) is true for p = 2 by interchanging the order of square 
summation and L 2 norm. 
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Note that is defined a priori on all of L p (by Theorem |J) and we 
may drop the assumption that / is in the Schwartz class. By Marcinkiewicz 
interpolation for I 2 vector valued functions it suffices to prove for G, H C R 2 
and E fc l/fcl 2 < Iff: 



H k f k A ,l G )\<\H\^\G\ 1 -p . (2.2) 



By Lebesgue's monotone convergence theorem it suffices to prove this under 
the assumption that G is supported on a large square [— N', N'] 2 as long as 
the implicit constant does not depend on N'. By another limiting argument 
using crude estimates in case the sets G and H have large distance it suffices 
to prove this under the assumption that H is supported in a much larger 
square [-N, N], again with bounds independent of N. Generalizing, we will 
only assume both G and H are supported on the larger square. 

Since we already have (|2.2j ) for p = 2, we immediately obtain this estimate 
for p > 2 provided \H\ < |G| and for p < 2 provided \G\ < \H\. By a 
standard induction on the ratio of \H\ and |G|, it then suffices to prove the 
following lemma. 

Lemma 4. Let G',H' C [— N, N] 2 be measurable and let | < p < oo. 

If p > 2 and 10|G'| < \H'\ then there exists a subset H C H' depending 



only on p, G' , and H' with \H\ > \H'\/2 such that §U\) holds with G = G' 
and any sequence of functions with X]|/c|<fc \fk\ 2 < Iff- 

If p < 2 and 10\H'\ < \G'\ then there exists a subset G C G' depending 
only on p, G' , and H' with \G\ > \G'\/2 such that ft2j\ ) holds with H = H' 

\k\<k 



and any sequence of functions fk with YlikKkn \fk\ 2 < Iff ■ 



For example in case p > 2 and 10|G'| < \H'\ we split H' into H and 
H'\H and apply the triangle inequality. On H' \ H we apply the induction 
hypothesis, which yields an estimate better than the desired one by a factor 
2~Vp because of the size estimate for H' \ H. On H we use the conclusion 
of the Lemma, which by choosing the induction statement properly we may 
assume to provide a bound no more than 1 — 2~ 1//p times the desired bound. 

By Cauchy Schwarz, (|2.2|) follows from 



\k\<k 

This in turn follows from 



J Y, \H k f k \ 2 lG<\H\$\G\ l -$ 



J £ \H k f k \H G <(^~ P [Y\fk\ 2 (2-3) 

\k\<k VII/ fe 

by the assumption on the sequence fk- Now define the operator Hk,G,H by 

Hk,G,Hf = lG#fc(lff/) • 
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Then (|2.3|) follows from the estimate 



\\Hk,G,Hfh ~ ( ThT ) "^' 2 ' 

for any / E L 2 , and < fco ; assuming the implicit constant does not 
depend on k or &q. We will prove this L 2 estimate again by Marcinkiewicz 
interpolation between weak type estimates. More precisely we will prove 

Theorem 5. Let p be as in Theorem g and let G',H' C M 2 6e as m Lemma 
[|. TTien f/iere are sets G, -ff as in Lemma ^ suc/j t/iai /or any measurable 
sets EjFcM 2 and each \k\ < /co we /iawe 

|(^, iGi h1f,1 S )| < (|||) 2 . (2.4) 

Note again that [|| proves 

\{H k>GiH l F ,l E )\ < iFli^l 1 -! (2.5) 

for all 1 < q < oo. The refinement we need here is the localization to G and 
H, with corresponding improvement in the estimate. We remark that the 
parameter k is irrelevant in proving ( |2,4D , but it is crucial that the sets H 
and G be constructed independent of k. By interpolating Theorem [| with 
(2.5) for q near 1 and oo we obtain strong type estimates 

-- 1 

\(H k ,G, H f,e)\<UQ 

where r is as close to p as we wish and q is in a small punctured neighborhood 
of 2 whose size depends on r. Another interpolation allows q to be 2 as well, 
and we obtain ( |2.3| ) with power r instead of p, which is no harm since we seek 
an open range of exponents. We have thus reduced Theorem [l] to Theorem 

I 

3. Construction of the sets G and H 

In this section we present the sets G and H of Lemmai and prove the 
size estimates \G\ > \G'\/2 and \H\ > \H'\/2. Inequality (2.4) will be proved 
in subsequent sections. 

We work with two shifted dyadic grids on the real line, 

T X = {^(n + ^^+l + ^i^eZ} , 

Z 2 = {[2 l ( n -^),2 t (n+l-^)):MG2} • 
The exceptional sets will be the union of two sets: 

H'\H = FiUff 2 , 
G'\G = G\ U G2 • 
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Fix i € {1,2}. The sets Hi and Gi will be constructed using the grid Zj, and 
we will prove A\H % \ < \H'\ and A\d\ < \H'\. 

Given a parallelogram with two vertical edges, we define the height H(R) 
of the parallelogram to be the common length of the two vertical edges. 
We define the shadow I(R) to be the projection of R onto the x axis. The 
central line segment of R is the line segment which connects the midpoints 
of the two vertical edges. If a line segment can be written 



then we call u the slope of the line segment. For each parallelogram R let 
U (R) be the set of slopes of lines which intersect both vertical edges. Note 
that maximal and minimal slopes in U(R) are attained by the diagonals 
of the parallelogram. Hence U(R) is an interval of length 2H(R)/\I(R)\ 
centered at the slope of the central line of R. 

For an interval U and a positive number C define CU to be the interval 
with the same center but length C|f/|. If R is a parallelogram, define CR 
to be the parallelogram with the same central line segment as R but height 
CH(R) (this definition of CR is used in Section [| only). Note that CU(R) = 
U(CR). For an interval I C I{R) define 

Ri = R n (I x R) . 

Given N and ko as in Lemma ^j, we consider a finite set IZi of parallel- 
ograms R as follows: the projection of both vertical edges of R onto the 
y-axis are in X\ DI2, and I(R) € Xj. Further, the parallelogram is contained 
in the square [— 10 2 iV, 10 2 N] 2 , the height is at least 2~ k °, and the slope is at 
most 10" 1 . These assumptions imply also that is also at least 2~ k °. 

We will use the following simple geometric observation: 

Lemma 6. Let R,R' be two parallelograms and assume I(R) = I(R'), 
U(R)nU(R') ^ $, RDR' / 0, and without loss of generality H(R) < H(R'). 
Then we have R C 7R'. Moreover, if7H(R) < H(R'), then 7R C 7R'. 

Proof. Since U(R)DU (R') 7^ 0, there exist two parallel lines, one intersecting 
both vertical edges of R and the other intersecting both vertical edges of 
R'. Since Rf) R' 7^ 0, the vertical displacement of these lines is less than 
H{R) + H(R'). If H(R) < H(R'), then the vertical edges of R have distance 
at most 2H(R') from the respective vertical edges of R' and are contained 
in the vertical edges of 7R'. This proves the first statement of the lemma. 
The second statement follows similarly. □ 

Let My denote the Hardy Littlewood maximal operator in vertical direc- 
tion: 



where the supremum is taken over all intervals J containing y. For a mea- 
surable function u : R — > R (which will be the slope function associated with 



{(x, y) : x G I(R) : y = ux + b} , 



M v f{x, y) = sup — - / \f(x,z)\dz 
yeJ \ J \ J J 
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we 



1-Q 



the given vector field), define 

E(R) := {(x,y) G R : u(x) £ U{R)} . 

3.1. Construction of the set H. With the sets G',H' as in Lemma | 
define 

Hi = \J{R € TZi : \E(R) n G'\ > S\R\} 

with 

for some small a to be determined later through application of Estimate 22 
and some constant C a large enough so that the desired estimate 4|iTj| < 
\H'\ follows from the following lemma, applied with G = G', q = j^t- 
Note that we are essentially eliminmating all rectangles R with large density 
parameter, where density has the meaning from Q. This will be used in the 
proof of Estimate [2^ later in the paper. Essentially, trees with density > 5 
will have extremely small size, and will therefore be mostly negligible. 

Lemma 7. Let 5 > and q > 1 and let G C M 2 be a measurable set 
and u : R — > R be a measurable function. Let 1Z be a finite collection of 
parallelograms with vertical edges and dyadic shadow such that 



\E(R)DG\ > S\R\ 



for each R £ 1Z. Then 



U 

Ren 



<5~ q \G\ 



Proof. We will find a subset Q C 1Z such that 

i U R \ 

Ren 



Reg 



E 



Reg 



\R\ 



\R\ 



Reg 

Inequality (|3.1| ) will complete the proof of Lemma [?] provided 

T,\R\<6-*\G\ . 



Reg 

But with the density assumption for the parallelograms in 1Z we have 

\R\<J2-MR)nG\ 



(3.1) 
(3.2) 

(3.3) 



E 

Reg 



Reg 



< 




G\ 1/q 



where in the last line we have used Holder's inequality and (3.2). After 
division by the middle factor of the right hand side we obtain (3.3). 
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The following argument is essentially the one used in to prove end- 
point estimates for the strong maximal operator. We select parallelograms 
according to the following iterative procedure. Initialize 

STOCK = K 

g = 

= 0. 

While STOCK / 0, choose an R G STOCK with maximal \I(R)\. If 



\7Rn7R'\>W- 2 \R\ 

R'eG: E(R)nE(R')^<b 



(3.4) 



then update 



Otherwise update 



STOCK 
G 
B 

STOCK 
G 
B 



STOCK \ R 
G 

Bu{R} . 
STOCK \ R 

gu{R} 

B . 



It is clear that this procedure yields a partition 1Z = Q U B. 

To prove (|3~T|) , let R G B and let R' be in the set G{R) of all elements in Q 
which are chosen prior to R and satisfy E(R)DE(R') ^ 0. The last property 
implies U(R) n U(R') ^ and R n R' / 0. Note also that I(R) C I(R'). By 
Lemma |^ applied to R and R'l^y we have for every vertical line L through 
the interval I{R)'- 

\LD7Rn 7R'\ > mm(H(R),H(R>) > • 

Comparing for (x, y) £ R and corresponding vertical line L the maximal 
function My with an average over the segment L n 7R we obtain: 

M y ( ^ l 7i? /)(x,y) > 7~ 1 H{R)- 1 ^ |Ln7.Rn7i2'| 

R'eGiR) R'eG(R) 

> 49" 1 |i?r 1 ^ \7Rr\7R'\ > 10" 4 , 

i?'e5(i?) 

where the last estimate followed from (13.41). Hence 



RGB 



<\{x: M v (^l fl )(x)>10- 4 }|<^|it!| 

by the weak (1, 1) inequality for My. This proves Q3.1| ), because the corre- 
sponding estimate for the union of elements in Q is trivial. 
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To prove consider R' , R G Q with E(R) n E(R') / 0. If # was 

selected first, then H(R) > 7H(R'), for otherwise we can use Lemma || as 
above to conclude for (x, y) E R 

M v (l 7R >)(x,y) > 7" 1 |i7(i?)r 1 \LH7Rn7R'\ > 49" 1 , 

R'£G{R) 

and hence R would have been put into B. Hence we have by Lemma |6| 

7i?j C 7i? 7 (3.5) 

for every I C I(R)- Hence 

\7R' I n7R I \= Y \ 7R 'i\ 

R'eG(R) R'eG(R) 

is proportional to |/| for I C I(R)- Hence we have for all such I 

]T |7i^n712j|<|fl/|, (3.6) 
R'eG(R) 



since for I = I(R) this holds when condition ( |3.4[) fails. 

Let's say an n-tuple (R , R 2 , . . . , R n ) of elements in Q is admissible if Ri 
is selected after Ri +l for each j and E(R^) n E{Ri +l ) ^ 0. Then we have 

/ { E ^w) s E w 1 ) n s ( fl2 ) n • • • n 

{R 1 ,R? ,...,R n ) adm. 

< ^ |7i? 1 n7i? 2 n...7i? n |. 

{R 1 ,R? ,...,R n ) admi. 

< £ |7i? 1 n7^ (i?l) n---n7^ (Jli) | . 

(iJ 1 ,/? 2 ,...,/?™) adm. 

Using ( |3.5[) , which implies that the sets 7-Rj/^s are nested, and the estimate 
( p.6| ) for the last pair of sets, we can estimate the last display by 

< £ |7i? 1 n7i?f (iil) n---n7^ ( - 1 i) | . (3.7) 



Iterating the argument allows us to conclude (3.2) for q' an integer, which is 
clearly not a restriction as the estimate is harder for larger q'. This completes 
the proof of Lemma |7[ □ 

3.2. Construction of the set G. Let G',H',u be as in Lemma |I] and 
define 

Gl = |J {^GTe^M^^^and^^^^-^^) 1 } 
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for some small e > to be determined later through application of Estimate 
21 and some constant C e large enough so that we obtain with Theorem || 
below: 

Gi\ < V C2- fc [ C f 2-^ + ^ ( SV^ \H'\ 



E 

fcez,fc<o 



G" 




< 



This construction essentially allows us to ignore trees with size and density 



both too large. This will be used in the proof of Estimate |21. 

The following theorem is a variant of the result in . The theorem there 
is valid for arbitrary Lipschitz vector fields. As stated here, the theorem is 
valid for vector fields depending on one variable. In fact, the theorem holds 
for vector fields that are Lipschitz in the vertical direction only. We recreate 
the proof given in p5l below in the one-variable case. The only use of the 



one-variable property comes in the proof of Lemma 12 below. 



Theorem 8. Let < S, a < 1, let H be a measurable set, and let 1Z be a 
finite collection of parallelograms with vertical edges and dyadic shadow such 
that for each R £ 1Z we have 

\E(R)\ > 5\R\ , \HnR\> o-\R\ . 

Then 



Ren 



\H\ 



Remark 9. It is of interest whether a result like Theorem [| holds with em- 
power less than 2. In the single height case, optimal results are already known 



with power all the way to 1 + e; see 



that the parallelograms in Theorem 18 can have arbitrary height, which is 
necessary for creating the exceptional sets needed in the current paper. 



However the important point is 



Proof. It is enough to find a subset Q C 7Z such that 



IN 



Ren 



Reg 



< 



< 



E 

Reg 



R\ 



Reg 



R\ 



(3.8 



(3.9) 



Namely, we have with 



]T \R\ < a' 1 I ^2l R (x)l H dx 

Reg J Reg 

< a- 1 \\H\\ 1 2(l'(J2^R(x)) 2 dx^ 
J Reg 



Reg 



and the desired estimate follows from (13.81). 
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We define the set Q by a recursive procedure. Initialize 

Q <- 0, 
STOCK <- p . 

While STOCK is not empty, select -R G STOCK such that is maxi- 

mal. Update 

a <- g u {i?} , 

B ^ {R' e STOCK : R' C {x : My(^ l fl )(x) > 10~ 3 }} , 

Reg 

STOCK <- STOCK \B 

This loop will terminate, because the collection P, is finite and we remove 
at each step at least the selected i? from STOCK. 

By the Hardy Littlewood maximal bound, it is clear that ( |3.8| ) holds 
and it remains to show ([O]). By expanding the square in ( |3.9| ) and using 
symmetry it suffices to show 

(R,R')eV Reg 

where P is the set of all pairs (i?, R') £ G x Q with i? n i?' / and i? is 
chosen prior to R'. We partition P into 

P' = {(R,R r ) G P : U(R) (jL 10 2 U(R')} , 
P" = {(R,R r ) £V :U(R) Cl0 2 U(R')} . 

Theorem || is reduced to the following two lemmas: □ 

Lemma 10. For fixed R' £ G we have 

]T \RnR'\<\R'\. 
Ren,(R,R')ev" 

Lemma 11. For fixed R € Q we have 

Y \Rn&\ < 5- 1 | J R| . 
R'en,(R,R')eV' 

Proof of Lemma f0[ We first argue by contradiction that P" does not 
contain a pair (R,R') with H(R') < H(R). By definition of P" we have 
[/(i?) n C/(100i?') / 0. By Lemma §, applied to lOOifyj?) and 10012', we 
conclude that R' is contained in 700-R. But then 

R! C {Myl R > 1/700} , 

which contradicts the selection of R' and completes the proof that we have 
H(R) < H(R') for all (R, R') G P". 
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Now we use Lemma |6] again to conclude that for each (R, R r ) G V" we 
have Ri(R') C 7002?'. Hence we have for some point (x, y) in R' 

10- 3 > m v ( E ^(^y) 

Reg-.{R,R')eV" 
~ 7002/(2?') ^ H(R) 



- 7nn ^ 1 1/ 1 1 



700 

R:(R,R')eV" 



This proves Lemma 1C. 

It remains to prove Lemma 11. Fix R G Q. We decompose {R' : (R,R') G 
■p'} by the following iterative procedure: Initialize 

STOCK <- {# : (22, 2?') G 7"} , 

£' <- 0. 

While STOCK is non-empty, select i?' G STOCK with maximal |Jr>|. 
Update 

g' <- g' u {r'} , 

B(R') <- {R" G STOCK : UE(R") n ILE(ii') ^ 0} , 

STOCK <- STOCK \B(R') , 

where IT denotes the projection onto the x axis. By construction, the sets 
HE(R') with 22' G Q' are disjoint and we have 

E M^ -1 E |ns(H')|<r 1 |/(fl)|. 

As the sets B(R') with 72' G £7' partition the summation set of the left-hand- 
side of Lemma [llj, it suffices to show for each R' G Q 1 

R"eB(R') 
In what follows we fix i?' G 

Lemma 12. There is an interval U (depending on R and R' ) of slopes with 

5\U(R)\ < \U\ , (3.10) 

U(R)r\5U = , (3.11) 

U(R) C 6U , (3.12) 

17(22") C U (3.13) 

for all R" C B(R'). 

Proof. We distinguish two cases: 

(1) \U(R)\ < \U(R')\ 

(2) \U(R)\ > \U(R>)\. 
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In the first case we use the definition of V' to conclude 

U(R) n 25U(R') = . 

We then define U = KU(R') where K > 5 is the largest number (or very 
close to that) such that U(R) n 5KU(R') = 0. Then we have immediately 
( |3.10D , ( |3.11| ) and ( 3.12[ ). To see ( |3.13| ) assume to get a contradiction that 
U(R") t U. 

By construction of B(R'), we know that U(E(R"))nn(E(R')) ^ 0, which 
implies that U (R")nU (R') ^ since the underlying vector field v is constant 
along vertical lines. Since U(R') is contained in the middle fifth of the 
interval U, we conclude \U\ < 3\U(R")\ and U C 7U(R"). But then U(R) C 
W 2 U(R"), a contradiction to (R, R") € V. 

In the second case we have H(R) > H(R') because \I(R')\ < \I(R)\. Since 
R' is not contained in the set {My 1r > 10~ 3 } and thus not in 10 3 ii, we 
conclude that U(R') contains an element not in 400U(R). Hence 

\U(R')\ U[ K) 

does not intersect U(R). From there we may proceed as before with U(R') 
replaced by this bigger interval. This completes the proof of Lemma |l2|. □ 

Lemma 13. Let I be a dyadic interval contained in Iri . Then for all R" € 
B(R') with H(R") < 20\U\\I\ we have that 

i?jni?'V0 => R"i c 50(1 + \u\\i\H(Ry 1 )R (3.14) 

and 

\R I nR"\<10\U\~ 1 H(R")H(R) . (3.15) 

Proof: By a shearing transformation and translation we may assume that 
the central line segment of R is on the x axis. 

Statement ( 3.14[ ) follows immediately from the central slope of R" being 
less than W\U\ and H(R") < 20\U\\I\, and hence the vertical distance of 
any point in R" from R is at most 50[[/||/|. To see the second statement, 
note that the central slope no of R" is at least 2\U\. Hence ( |3.15| ) follows 
because R n R" is contained in a parallelogram of height H(R) and base 
H(R")u () 1 . This proves Lemma 

Lemma 14. Let L be a dyadic interval contained in Lri . If 

\Ri r\R"\ > Kr 1 !^/! 

R"eB(R'):ICl R „ 

then there does not exist R'" £ B(R') with Ir»' C /, Ir>" ^ /. 
Proof For every R'" G B(R') we have U(R'") C U and thus 

H{R"') < IW\I R ,»\ . 
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Hence if I R m C I then H(R"') < 20U\I\. The parallelogram R'" has been 
selected for Q after the parallelogram R and the parallelograms R" G B(R') 
with I C /r" . By Lemma [l3| it suffices to show that the maximal function 

M V (1 R + 1 R") 

R"eB(R'):ICl R „ 

is larger than 10 -3 on the parallelogram 

R ■ = 50(1 + pWllHiRy^R . 

First assume there exists R" G B(R') with I C I R » and Rj f) R" ^ and 
H{R") > 20|f7||J|. Note that U(R") and U(R) have non-empty intersection 
because U(R") C U C U(R). Applying Lemma |^ to the rectangles R" and 
i?/ we obtain similarly as before 

M V (1 R » + 1 R ) > 7- 1 iJ(^)- 1 (min( J H"(i?"),i?( J R)) + ii"(-R) > 1(T 3 

on Rj, which proves Lemma [i~4| in the given case. 
Hence we may assume 

H(R") < 20\U\\I\ 

for every R" G B(R') with I C Jr» and Ri n / 0. We then have on Rj 
by Lemma |l3] 

M v (l fl + ^ H(R)- 1 (H(R)+ H ( R "» 

R"eB(R'):ICI R n R"eB(R'):ICl R // 

> H(R)-\H(R)+ Yl iRinrt'lPlHiR)- 1 ) 

R"€B{R'y.ICl R „ 

> H(R)(H(R) + \U\H(R)- 1 IQ- 1 \R I \) ^WO- 1 . 

This completes the proof of Lemma |l4|. □ 

Note that we have used the hypothesis I R m ^ I of Lemma ( |14| ) only to 
conclude that R'" has been selected last to Q. Consider the collection of all 
R" G B(R') with / = and let R'" the parallelogram chosen last in this 
collection. Since \Rj n < \Ri\, the proof of the previous lemma also 
gives 

Lemma 15. We have for every I C I R > 

\RinR"\<2\R I \ . 

R"£B(R'):I=I R „ 

Now let I be the set of maximal dyadic intervals contained in I R such 
that 

^2 \Ri nR"\ > 2\Rj\ . 

R"eB(R'):ICl R /> 
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By Lemma 
By Lemma 



15 
14 



we have Iri £ X. Let I G I and denote the parent of I by /. 
and by maximality of I and Lemma Il5| we have 



Y \RiDR"\ = Y \RinR"\+ Y \RinR"\ 

R"eB(R') R"€B(R'):ICI R „ R"eB(R>):I=I R „ 

< 2\R } \ + 2|i?/| < 6|i?/| . 
By adding over all / G Z we obtain 

Y Y \Ri^R"\<6\Ri(R')\ ■ (3-16) 

I&X' R"eB(R') 

Now let I' be the set of maximal dyadic intervals which are contained 
in Iri, disjoint from any interval in I, and do not contain any I(R") with 
R" G IZ(R'). By construction of I we have for each / G I' 

Y \Rir\R"\ = Y \RinR"\<2\R I \ . 

R"EK(R') R"eK(R'):ICl R „ 
Summing over all intervals in I' gives 

Y Y \Ri^R"\<2\R(l(R')\ ■ (3-17) 
I el' R"elZ(R') 

Together with ( |3.16D this completes the proof of Lemma 11, because I and 
I' form a partition of I(R'). 

4. Outline of the proof of Theorem || 
Recall that we need to prove for each |A:| < ko the inequality 



\{H kjGjH l F ,l E )\ < ( ifiV " \F\^E\^ 



(4.1) 



We assume without loss of generality that E C G and F C H. Recall also 
that Theorem [2] implies for 1 < q < 00: 



\{H k l F ,l E )\ < 



E\ 
\F\ 



2 q , , 1 , 1 1 
\F 2 \E 2 



(4.2) 



The left hand sides of (4.1) and (4.2) are identical. Hence our task is to 
strengthen the proof of Theorem |2 in J2] in case the factor involving G and 
H in (4.1) is less than the corresponding factor involving E and F in (4.2). 

We recall some details about the proof in The form (Hk^-Fi^-E) is 
written as a linear combination of a bounded number of model forms 

/ J (C^fcljr, 1e) , 
seU k 

where the index set Uk is a set of parallelograms with vertical edges and 
constant height (depending on k). The paper proves the bound analogous 
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to (4.2) for the absolute sum 

\{C S)k l F ,l E )\ , (4.3) 

seW k 

where U' k is an arbitrary finite subset of U k and the bound is independent 
of the choice of subset, which may be assumed to only account for non-zero 
summands. 

To estimate (O), one first proves estimates for the sum over certain 
subsets of U' k called trees. Each tree T is assigned a parallelogram top(T). 
It is also assigned a density S(T) which measures the contribution of E to 
the tree, and a size cr(T) which measures the contribution of F to the tree. 
One obtains for each tree T: 

^2\{C s l F ,l E )\<5(T)a(T)\top(T)\ . 

The collection U' k is then written as a disjoint union of sub-collections Us iCr 
where 5 and a run through the set of integer powers of two. Each i/<s )CT is 
written as a disjoint union of a collection 7a i(7 of trees with density at most 
5 and size at most a. With the above tree estimate it remains to estimate 
E<5, CT s S,a with 

Ss,a ■= ^2 Sa \ to P( T )\ ■ 

TeT S:a 

We list the estimates on Sg >a used in J|; note that we include an additional 
factor of 5a relative to the corresponding expressions in |2|. 

Estimate 16 (Orthogonality). Ss,„ < \F\5a~ 1 . 

Estimate 17 (Density). Ss yf7 < \E\a . 

Estimate 18 (Maximal). For any e > 0, Ss, a < {F^lE^a^ . 

Estimate 19 (Trivial density restriction). If 5 > 1, then S$ )(T = . 

Estimate 20 (Trivial size restriction). There is a universal o~q such that if 
a > <7o, then Ss )(7 = . 

Our improvement comes through two additional estimates depending on 
G and H that will be proved in Section |5[ 

Estimate 21 (Second maximal). If p < 2 and G, H are as in Theorem^, 
then for every e > 

SvSw({§j)V<ri-< . 

Estimate 22 (Size restriction). Let p > 2 and G, H as in Theorem^. Let 
n > 2 be a large integer and a = 1/n and C a be some constant. Then there 
is a constant o\ such that if 
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with 



H 

\H\ 



l-a 



then we have Sg a = . 

To obtain summability for small a, it is convenient to take weighted geo- 
metric averages of Estimates 16, [l8|, and 21 with Estimate 17 to obtain 
positive powers of a. We record these modified estimates, where we simplify 
exponents using that we may assume universal upper bounds on 5 and a. 
We have for any e > 0: 

Estimate 23 (Modified Orthogonality). S s>a < \E\\ +e \F\l~ e 8^~ € a 2e . 

Estimate 24 (Modified maximal). S s , a < |F| 1_4e |£?| 4e o- e . 

Estimate 25 (Modified Second maximal). Under the assumptions of Esti- 



mate 21 



■v. t \f\ ( HI 



In the rest of this section we show how these estimates are used to estimate 
Ylscr $5,0 an d thereby complete the proof of Theorem ||. 

4.1. Case p < 2 and \H\ < \G\. Inequality (fO]) for § < p < 2 follows from 
inequality (|4.2j) for 1 < q < 2 unless 



\H[ 

W\ 

which we shall therefore assume. 

Pick e 
over 5 at 



i 

3 \F\ 

- W\ 



(4.4) 



Pick e > small compared to the distance of p to |. We split the sum 



, , \H\ \E\ 
° = ' W\W\ 



For 5 < 8o we use Estimate 23 together with Estimate 20 to obtain 



EEv < si 



\E 2 +e \F 2" 



, 3 , e , ,1 e 
\E 4 + 2 IF 4 _ 2 



<5<<5o °" 



|G| 



4 2 



For 5 > So we use Estimate 25 together with Estimate |20| to obtain 

\H\ 
W\ 



5>5o CT 



LB 4 IF 2 



|JET| 



Using ( [4.4| ) and < |G| we may estimate both partial sums by 

\H\\^ 



< \E\hF\^ 



\G\ 



This completes the proof of (4.1) in case p < 2. 
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4.2. Case p > 2 and \G\ < Pick e very small compared to ~. Inequal- 
ity fl4.1p for 2 < p < oo follows from inequality ([4.2[) unless 



< 



l+e 



(4.5) 



Iff I - vl^l 

which we shall therefore assume. Let a and 1/n be very small compared to 
e, let C a be as in the construction of the set H and let 5 be as in Estimate 
22, We split the sum over 5 at 

\S F 



For 5 < 5\ we use a weighted geometric mean of Estimates and 24 



together with Estimate 20 to obtain 

<5<<5i <t 



- 4e l£ , |2- £ |F|5 +e 



< $(l-*)(*-*0|£|*|F|* f M 

-HI 



-2c 



of 5 in Estimate 22 we may estimate the last display by 



where in the last line we have used (4.5) and |G| < \H\. Using the definition 

(4.6) 



< \E\2\F\2 



H 

\H\ 



-10e 



For 5 > 61 we use Estimate [I?] together with Estimate |22| to obtain 



<5><5i a 



5>5i 



< <5|F|< \F\2\E\2 



\G[ 
\H\ 



-10e 



where in the last line we have used 
proof of (|4.1|) in case p > 2. 



and |G| < This completes the 



5. Proof of the additional Estimates 21 and 22 



In this section we deviate from the notation in Section || as follows: for 
a parallelogram R we denote by CR the isotropically scaled parallelogram 
with the same center and slope as R but with height H(CR) = CH(R) and 
shadow I(CR) = CI{R). 

We say that a set is approximated by a parallelogram R, if it is contained 
in the parallelogram and the parallelogram has at most one hundred times 
the area of the set. Any parallelogram R can be approximated by a paral- 
lelogram R' with I(R') € T\ UI2 and both vertical edges of R' in X\ UZ2. 
To see this, first identify an interval I in X\ U 1% which contains I(R) and 
has at most three times the length; this interval / will be the shadow of R'. 
Consider the extension of R which has same central line and height as R 
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but shadow /. Then find two intervals in X\ U Z2 which have mutually equal 
length at most three times the height of R and which contain the respec- 
tive vertical edges of the extended parallelogram. These intervals define the 
vertical edges of R' . 

We recall some details of the proof of Estimate |l7] in [||. Given 5, a, 
one constructs a collection lZ$ ta of parallelograms of the same height as the 
parallelograms in UL such that each tree T in T$^ a is assigned a parallelogram 
R in IZs^ with top(T) C CqR and top(T') C CqR for every sub-tree T' of 
T, for some constant Co- If T(R) denotes the trees in Ts )CT which are assigned 
a given parallelogram R £ TZs,ai then we have 

E |top(T)| < C X \R\ 
TeT(R) 

for some constant C\ . Estimate [I?] is then deduced from the inequality 

E \ R \^\ E \*~ X • (s- 1 ) 

which follows essentially from pairwise incomparability of the parallelograms 
in TZg a . (In other words, if two parallelograms Pi, Pi overlap, then they 
are pointed in different directions, resulting in disjointness of the sets E(P\) 
and E(P2).) All parallelograms in lZg i(7 have height at least 2~ k °, length of 
shadow at least 2 _fc °, and slope at most 10 _1 . 

Let Q = [-N,N] 2 be the large square with N as in Lemma ||]. We claim 
that every set Q n 2 k R with R G K 5a and k > can be approximated 
by a parallelogram in is a parallelogram then this is 

clear by the remarks above. If Q n 2 k R is not a parallelogram, then we first 
extend it to the minimal parallelogram containing it, which thanks to the 
bounded slope of R is not much larger than Q n 2 k R, and then approximate 
the extension by a parallelogram in TZ± U 7^-2 • 

5.1. Proof or Estimate |2j| . We partition TZ$. a into subset T^s,a,j consisting 
of all parallelograms in 7Zs j(7 such that 

C^-i-^Rl < l to P( T )l < Ci2- j \R\ . 

TeT{R) 

We claim that TZ-s^j is empty unless j satisfies ( |5.3[) below. This claim 



together with ( |5.1|) will prove Estimate 21: 

JO<j T^S,a,j j0<j 



1 



H\\2 



It remains to prove the claim. Suppose there is a parallelogram R in 
R G 7ta,6,j- It has large density as defined and discussed in (2|, which 
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implies that there is a k > with 

|.E(2 fc .R) n G| > 2 20k 5\2 k R\ . 

Since G is contained in Q, we may approximate Q n 2 k R by a parallelogram 
R' of TZi U IZ2 and obtain 

\E(R')\ > \E(R') n G| > 2 20k 5\R'\ . (5.2) 

Now suppose first that 2 k > a~ e . By Claim 18 in and using that 
F C Q, we obtain 

IF n h n R'\ > \FnHn2 k R\ > 2 - 2 k 2 - j(7 i+e 



\R'\ ~ |2 fc 12| 

On the other hand, (5.2) implies in particular R' n G 7^ 0, which by con- 
struction of G (see Section |[) implies, using k > 0: 



2 -2* 2 - V+e < (2 20^ r (i+ e ) /l|iV 



1 

\H\\* 



2-1 < 2-1° := a-L-<8-2-< ( |_I ) . (5.3) 



If 2 k < a e we use the variant 

IF n n n o-- e R\ 



> 2~V +3e 



\a- e R\ 

of Claim 18 in to obtain the same conclusion. 



5.2. Proof of Estimate 22. Note that by Estimates 19 and 20 we may 



assume CqS < 6 with Co as above. Suppose Ts i<7 is non-empty. Consider a 
tree T in Ts 1<r and let R £ 7£<5 jCr be the associated parallelogram as above. 
As above we have for some k > 0: 

|F(2 fc F) RG| > 2 m 5|2 fc 12| . 

Define m so that (5 is within a factor two of CQ2 m 5 and note that m > 0. 
Let 12' G 7?4 U 7£ 2 be an approximation of Q n max(2 fc , G 2 m )F. We then 
have 

\E(R')nG\ > 5\R'\ . 
By construction, R' is disjoint from H. Since top(T) is contained in Gol2, 
we have that 2 m top(T) is contained in 12' U Q c , and the same holds with T 
replaced by any sub-tree T" of T. 

But by Lemma 29 of Q with / = \fc\H, we obtain with the notation in 
that Lemma for every sub-tree T" of T: 

^ K/;^s)| 2 = XT l^ 1 2 m '+ 1 top(T')\2™'top(T')'^ s )| 2 

sST' m'>ms£T' 



< \ r>— 4nm' 11 j?i i|2 
~ H/ i 2 m '+ 1 top(T')ll2 



m'>m 
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< 2- 2 " m |top(T')| . 
By the definition of cr(T) this implies 

<r{T) < T nm , 
which in turn implies Estimate |22j . 
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